Fragments of nucleic acids specific to mycobacteria which are members of the M. tuberculosis complex and their applications for the detection and the differential diagnosis of members of the M. tuberculosis complex

ABSTRACT

A fragment of a nucleic acid specific to mycobacteria of  M. tuberculosis  complex having a nucleotide sequence of SEQ ID No: 1 and SEQ ID No: 2 and their complimentary sequences.

[0001] The present invention relates to sequences of nucleic acids of mycobacteria belonging to the M. tuberculosis complex.

[0002] The invention likewise relates to sequences of in vitro detection of strains of mycobacteria belonging to the M. tuberculosis complex as well as to a method for a differential diagnosis of strains of the M. tuberculosis complex, especially for differentiating the presence of the BCG from that of other members of the complex in a sample.

[0003] Approximately, 1.7 billion people or ⅓ of the world population are infected with M. tuberculosis (Sudre et al., 1992). In 1990, the estimated number of cases of tuberculosis was 8 million, including 2.9 million deaths (Sudre et al., 1992). These last few years, the number of cases of tuberculosis in the Unites States and in Europe has increased by 3 to 6% per annum, principally in populations at high risk such as patients suffering from AIDS, chronic alcoholics, the homeless and drug addicts (Barnes et al., 1991).

[0004] Taking account of the difficulties in fighting infections by mycobacteria, there is an urgent need to be able to have a specific and sensitive rapid method allowing these infections to be diagnosed. Also, the early detection of M. tuberculosis in clinical samples is taking on growing importance in the control of tuberculosis both for the clinical treatment of infected patients and for the identification of exposed individuals at risk.

[0005] Detection by PCR (Polymerase Chain Reaction) of specific DNA of species of mycobacteria is probably one of the most promising novel approaches for rapid, specific and sensitive diagnosis (Saiki et al., 1985: Brisson-Noël et al., 1989; Cousins et al., 1992; Eisenach et al., 1990; Forbes et al., 1993; Fries et al., 1991; Hermans et al., 1990; Jonas et al., 1993; Kolk et al., 1992: Pierre et al., 1991; Saboor et al., 1992; Shankar et al., 1991; Sjöbring et al., 1990). The different studies carried out up to date, however, have led to different results as far as the specificity and the sensitivity are concerned. Among the reasons for this diversity, it is especially possible to note methodological differences concerning the preparation of the samples, the protocol followed for the amplification or the methods of detection of the PCR products.

[0006] Several of these studies relate to the detection by PCR of the M. tuberculosis complex starting from the target DNA IS6110 (Clarridge et al., 1993; Eisenach et al., 1990; Forbes et al., 1993; Noerdhoeck et al., 1994), the gene coding for the 65 kDa antigen (Brisson-Noël et al., 1991; Telenti et al., 1993), the gene coding for the 38 kDa antigen (Folgueira et al., 1993: Sjöbring et al., 1990) or the ribosomal 16S RNA (Kox et al., 1995). However, all these diagnostic tests identify the M. tuberculosis complex in its entirety.

[0007] The M. tuberculosis complex comprises M. tuberculosis, M. bovis, M. microti and M. africanum. These four species have strong homologies in their DNA (85 to 100%) (Imeada, 1985) and have several totally identical genes. This homology has limited the use of DNA sequences to differentiate the strains. It would nevertheless be of particular interest to be able to differentiate the strains of M. tuberculosis from those of the Calmette-Guérin (BCG) bacillus M. bovis, the latter often being used as live vaccines for immunoprotection against tuberculosis. Obviously, there is thus an interest in distinguishing between possibly pathogenic mycobacteria and vaccinating BCG strains. This distinction is particularly important in the case of immunodeficient individuals, such as subjects infected by HIV.

[0008] Restriction fragment length polymorphism (RFLP) analyses based on the insertion of the IS6110 elements have been used to differentiate different strains of M. tuberculosis. It has been shown that this insertion element allowed specific RFLP profiles of strains to be obtained on account of a variability in its localization and in the number of copies existing in the genomes of different strains. IS6110 has been demonstrated in M. tuberculosis and in M. bovis but not in the other mycobacteria tested (Cave et al., 1991). In general, several copies of this insertion element can be demonstrated in M. tuberculosis, although a single copy is found in M. bovis BCG (Cave et al., 1991). However, certain strains of M. tuberculosis are devoid of IS6110 (van Soolingen et al., 1994) and strains of M. bovis having a significant number of copies are common in some populations (van Soolingen et al., 1994). Added to these limitations is the fact that the identification of strains of M. tuberculosis from IS6110 necessitates Southern Blot analyses and cannot easily be carried out by routine PCR methods.

[0009] The authors of the present invention have demonstrated a novel target DNA sequence for enzymatic amplification. This sequence is part of an operon coding for a specific two-component regulatory system M. leprae and bacteria which are members of the M. tuberculosis complex. The two-component systems are regulatory systems belonging to a large family, involving two proteins which cooperate in translating external signals by modifications in the level of genetic expression (Parkinson and Kofoid, 1992). According to a proposed general model, one component localized in the membrane could act as a sensor of environmental changes and could transmit the information to a regulatory component of the response in the cytoplasm, which in turn could modulate the transcription of target genes. The communication between the two components is generally carried out by a cascade of phosphorylations.

[0010] The authors of the present invention have at present cloned and characterized a two-component mycobacterial system. This system appears to be specific to members of the M. tuberculosis complex and to M. leprae. Unexpectedly with respect to the other two-component systems, the genes of this mycobacterial system are separated by DNA sequences (intercistronic sequences or repeated sequences) which are uniquely present among the strains of mycobacteria of the M. tuberculosis complex and in M. leprae.

[0011] The authors of the present invention have additionally shown that in the strains of the M. tuberculosis complex, this intercistronic region corresponded to an exact or truncated number of repetitions of a sequence of 77 base pairs, SEQ ID No. 1, which was able to vary among the strains. The truncated sequence (SEQ ID No. 2) is composed of 53 base pairs and contains a short in-phase internal deletion of nucleotides Nos. 40 to 66, which are substituted by a GAG codon.

[0012] The authors of the invention have likewise shown that, surprisingly, the truncated sequence (SEQ ID No. 2) was characteristic of the members of the M. tuberculosis complex which are different from BCG.

[0013] In M. leprae, the corresponding intercistronic region is formed by a variant of 52 base pairs already additionally described and whose sequence is indicated below:

[0014] 5′ atg aca ccc gcg cag gcg atg atg cag agc gaa gtg acg aga ggg aat gtg a 3′.

[0015] Taking account of their characteristics, these repeated sequences are of particular interest for identifying and differentiating between strains of the M. tuberculosis complex by enzymatic amplification techniques, such as PCR or other similar methods. More particularly, they allow a differential diagnosis between the presence of BCG and that of other members of the complex in a biological sample to be established. This differential diagnosis, whose principle is based on the specific detection of the sequence SEQ ID No. 2, forms a preferred aspect of the invention. It is advantageously employed to distinguish an infection by BCG from an infection by other virulent members of the complex in immunodefficient individuals, such as especially subjects infected by HIV.

[0016] The invention thus relates to a specific fragment of nucleic acids of mycobacteria belonging to the M. tuberculosis complex, comprising a sequence of nucleotides chosen from amongst the sequence SEQ ID No. 1, the sequence SEQ ID No. 2, their complementary sequences or the sequences of nucleic acids capable of hybridizing with one of the preceding sequences under conditions of high stringency.

[0017] It is likewise aimed at the use of the said sequences for the production of nucleotide probes for the in vitro detection of mycobacteria belonging to the M. tuberculosis complex and of oligonucleotide primers for the enzymatic amplification of specific sequences of strains which are members of the M. tuberculosis complex.

[0018] The invention likewise relates to a method allowing tho strains of the M. tuberculosis complex to be differentiated from one another, particularly for allowing BCG to be differentiated from pathogenic mycobacteria of this complex.

[0019] It likewise provides means for differentiating sub-groups among the strains forming the M. tuberculosis complex.

[0020] In addition, it can allow M. leprae of the species and of the strains belonging to the M. tuberculosis complex to be differentiated in a biological sample.

[0021] In the context of the present invention, it is considered that the conditions of “high stringency” in which two nucleotide sequences can hybridize are the conditions defined by Sambrook et al., 1989, namely temperature conditions of between (T_(m) minus 5° C.) and (T_(m) minus 30° C.) and additionally preferably temperature conditions of between (T_(m) minus 5° C.) and (T_(m) minus 10° C.) (high stringency), T_(m) being the melting temperature, defined as being the temperature at which 50% of the paired strands separate. Preferentially, the conditions of high stringency used correspond to prehybridization, hybridization and washing temperatures of 68° C. and a prehybridization and hybridization buffer based on 5×SSC (protocol recommended by Boehringer Mannheim).

[0022] The invention relates to a specific fragment of nucleic acids of mycobacteria of the M. tuberculosis complex localized in the intercistronic region of the senX3-regX3 system.

[0023] As will be explained in the examples, the sequence SEQ ID No. 1 corresponds to an entity of 77 repeated base pairs and a different number of copies according to the strains of mycobacteria studied. This repeated sequence has an open reading frame (ORF) which can code for a peptide of 25 amino acids.

[0024] The invention is thus aimed at a fragment of nucleic acids specific to the M. tuberculosis complex, comprising a sequence of nucleotides selected from the sequence SEQ ID No. 1, its complementary sequence or the sequence of nucleic acids capable of hybridizing with one of the preceding sequences under conditions or high stringency.

[0025] According to another aspect, the invention is aimed at a specific fragment of nucleic acids of members of the M. tuberculosis complex which are different from BCG, especially the virulent species M. tuberculosis, M. africanum or M. bovis, comprising a sequence of nucleotides chosen from amongst the sequence SEQ ID No. 2, its complementary sequence or the sequences of nuclear acids capable of hybridizing with one of the preceding sequences under conditions of high stringency.

[0026] As is actually illustrated by the examples, the BCG strains can be differentiated from all the other strains of the M. tuberculosis complex because of the absence of the sequence SEQ ID No. 1 in the senX3-regX3 intergenic region in BCG (see Table 3, groups 4, 6, 8).

[0027] The different nucleotide sequences of the invention can be of artificial origin or non-artificial origin. They can be DNA or RNA sequences.

[0028] They can be prepared, for example, by chemical synthesis, or alternatively by mixed methods including chemical or enzymatic modifications of sequences obtained by screening sequence libraries, by means of probes elaborated on the basis of the sequences SEQ ID No. 1 or 2. Such libraries can be prepared by the classical techniques of molecular biology, known to the person skilled in the art.

[0029] The nucleotide sequences of the invention allow the production of probes or of nucleotide primers capable of hybridizing specifically with these, their corresponding RNA sequences or the corresponding genes under conditions of high stringency. Such probes are likewise part of the invention. They can be used as an in vitro diagnostic tool for the detection, by hybridization experiments, of specific nucleic acid sequences of mycobacteria belonging to the M. tuberculosis complex.

[0030] Preferentially, the probes of the invention have at least 24 consecutive nucleotides although shorter probes can be equally suitable. At the maximum, these have the complete senX3-regX3 intergenic region of M. tuberculosis (IPL), namely two successive sequences SEQ ID No. 1 followed by a sequence SEQ ID No. 2. This DNA fragment contains 218 base pairs.

[0031] Among the specific preferred nucleotide probes of the members of the M. tuberculosis complex, the sequence SEQ ID No. 1 in its entirety or its complementary sequence appear especially.

[0032] According to another aspect, the invention is aimed at nucleotide probes for the detection and the demonstration of sequence or nucleic acids specific to members of the M. tuberculosis complex which are different from BCG, especially the virulent species M. tuberculosis, M. africanum or M. bovis, as well as for the differential diagnosis of the presence of BCG in a biological sample containing mycobacteria of the M. tuberculosis complex.

[0033] Such nucleotide probes are obtained from a region of the sequence SEQ ID No. 2 surrounding the GAG codon in position 40-42 or of its complementary strand. Preferentially, this region has a length of 21 base pairs, and contains 9 nucleotides upstream and downstream of the specific GAG codon of the sequence SEQ ID No. 2.

[0034] Advantageously, these probes comprise the sequence SEQ ID No. 2 in its entirety or its complementary sequence.

[0035] The probes of the invention hybridize according to the appropriate hybridization conditions, corresponding to the conditions of temperature and of ionic strength usually used by the person skilled in the art.

[0036] Preferentially, the probes of the invention are labelled, previously to their use. For this, several techniques are at the disposal of the person skilled in the art (fluorescent, radioactive, chemiluminiscent, enzymatic, etc. labelling).

[0037] According to a preferred embodiment, the probes are labelled with digoxygenin. Digoxygenin (DIG) is a steroid hapten coupled to dUTP for labelling DNA used as a probe. This technology is marketed by Boehringer Mannheim. After the hybridization with the probe and washing, the detection is carried out following an emission of a chemoluminescent signal by means of the substrate disodium 3-(4-methoxyspiro{1,2 dioxethane-3,2′-(5-chloro)tricyclodecan}-4-yl)phenylphosphate (CSPD).

[0038] The nucleotide sequences of the invention are likewise useful for the production and the use of oligonucleotide primers for sequencing reactions or for enzymatic amplification.

[0039] The enzymatic amplification techniques are principally illustrated by PCR. Other similar methods can, however, be used such as, for example, LCR (Ligase Chain Reaction), NASBA (Nucleic Acid Sequence Based Amplification), Q-βreplicase, SDA (Strand Displacement Amplification) and any other variant comprised in the technological knowledge of the person skilled in the art. These nucleic acid amplification techniques use oligonucleotide primer molecules to initiate the elongation reaction of the target sequence. The exact length of these primers will be able to vary according to the case. For example, as a function of the complexity of the sequence of the matrix, a polynucleotide primer typically contains from 15 to 25 nucleotides or more. In certain cases, however, it can contain less.

[0040] Preferentially, the nucleotide primers of the invention comprise at least 19 nucleotides. They are formed by primers chosen from sequences adjacent to the senX3-regX3 intergenic region, in the regions 3′ of senX3 and 5′ of regX3.

[0041] According to a preferred variant, the pair of primers referred to as C5 and C3 are used, namely 5′GCGCGAGAGCCCGAACTGC3′ and 5′GCGCAGCAGAAACGTCAGC3′ corresponding respectively to the 3′ end of the senX3 gene and the 5′ end of the regX3 gene. These primers respectively hybridize 56 base pairs upstream of the intercistronic region and 62 base pairs downstream of the latter.

[0042] The nucleotide sequences and the probes resulting therefrom can be cloned in cloning and/or expression vectors according to classical techniques of molecular biology, especially involving the use of restriction enzymes and of specific cleavage sites.

[0043] A preferred cloning vector in the present invention is represented by the plasmid pRegX3Mt1 deposited at the CNCM under the number I-1766 whose construction is described in greater detail in the examples below.

[0044] Another cloning vector according to the invention is represented by the plasmid pRegX3Bcl deposited at the CNCM under the number I-1765. These plasmids have each been introduced into the bacterium E. coli XL1-blue.

[0045] The vectors I-1765 and I-1766 each contain the complete senX3-regX3 genes with the intercistronic regions of BCG and of M. tuberculosis respectively.

[0046] The nucleotide sequences according to the invention can additionally be expressed in appropriate systems, for the production and the study of the biological activities of the corresponding peptides. In this case, these will be placed under the control of signals allowing their expression in a cell host.

[0047] An efficient system of production of a protein or of a recombinant peptide necessitates having a vector, for example of plasmid or viral origin, and a compatible host cell.

[0048] The cell host can be chosen from prokaryotic systems, like bacteria, or eukaryotic systems, for example like yeasts, insect cells, CHO (Chinese hamster ovary) cells or any other system advantageously available.

[0049] The vector must contain a promoter, translation initiation and termination signals, as well as the appropriate regions of transcription regulation. It must be able to be maintained stably in the cell and can possibly have particular signals specifying the secretion of the translated protein.

[0050] These different control signals are chosen as a function of the cell host used. To this end, the nucleotide sequences according to the invention can be inserted into autonomic replication vectors in the chosen host, or integrative vectors of the chosen host. Such vectors will be prepared according to the methods currently used by the person skilled in the art, and the clones resulting therefrom can be introduced into an appropriate host by standard methods, such as, for example, electroporation.

[0051] The in vitro diagnostic or detection methods in which the nucleotide probes obtained from sequences of the invention are employed are likewise part of the present invention.

[0052] More particularly, the invention is aimed at a method of detection of strains of mycobacteria belonging to the M. tuberculosis complex in a biological sample comprising the following steps:

[0053] (i) contacting the biological sample with a pair of primers under conditions allowing hybridization of the said primers to the nucleic acids specific to strains of mycobacteria belonging to the M. tuberculosis complex;

[0054] (ii) amplification of the said nucleic acids;

[0055] (iii) contacting a nucleotide probe according to the invention with the said biological sample under conditions allowing the formation of hybridization complexes between the said probe and the amplified nucleic acid sequences;

[0056] (iv) detection of the hybridization complexes formed.

[0057] According to a first variant, the method according to the invention allows the presence of any member of the M. tuberculosis complex to be detected in a biological sample. In this case, the complexes of step (iii) defined above are formed with a specific nucleotide probe of the sequence SEQ ID No. 1 or of its complementary strand.

[0058] According to a second advantageous variant, the method of the invention allows the presence of members of the M. tuberculosis complex other than BCG to be specifically detected. According to this variant, the complexes of step (iii) which are detected are formed with a nucleotide probe specific to the sequence SEQ ID No. 2 or to its complementary strand, consisting preferentially of a short sequence compound of two times 9 base pairs framing the GAG codon at the specific positions 40 to 42 of the sequence SEQ ID No. 2.

[0059] According to a third particularly advantageous variant, the method of the invention allows differential diagnosis of the BCG and of other members of the complex. In this case, the method consists at first of demonstrating nucleic acids specific to all the members of the M. tuberculosis complex by detection according to step (iv) of the hybridization complexes formed with a first specific nucleotide probe of the sequence SEQ ID No. 1 or of its complementary strand, then in finding among the amplified nucleic acids capable of forming complexes with the said first probe those which can likewise form complexes with a second specific nucleotide probe of SEQ ID No. 2 or of its complementary strand.

[0060] The amplified nucleic acid sequences hybridizing uniquely with the first probe correspond to specific nucleic acid sequence of the BCG, although the sequences hybridizing with each of the two probes correspond to sequences of the other members of the M. tuberculosis complex.

[0061] This method of differential diagnosis is of obvious interest with respect to the conventional methods of detection, because it allows an infection by BCG (possibly from a vaccination) to be differentiated from that by a virulent mycobacterium of the M. tuberculosis complex (M. tuberculosis, M. bovis or M. africanum, and possibly M. microti). This distinction is particularly important in immunodeficient individuals, especially subjects infected by HIV.

[0062] According to another preferred embodiment, the invention provides a method of identification of groups of mycobacteria belonging to the M. tuberculosis complex, characterized in that:

[0063] the DNA of the said strains previously extracted with a pair of primers such as defined above is contacted under conditions allowing a specific hybridization of the said primers with their corresponding sequences on the DNA of the said strains and the obtainment of amplification products, and

[0064] the length of the amplification products obtained is measured, for example by agarose gel electrophoresis.

[0065] Advantageously, the primers 5′GCGCGAGAGCCCGAACTGC3′ and 5′GCGCAGCAGAAACGTCACC3′ are used in this method.

[0066] The invention likewise relates to a kit for the in vitro identification of strains of mycobacteria belonging to the M. tuberculosis complex in a biological sample comprising:

[0067] a pair of primers according to the invention, as defined above;

[0068] the reagents necessary to allow the amplification of the specific sequences of nucleic acids belonging to the M. tuberculosis complex with the aid of the said primers,

[0069] possibly means for revealing the amplified fragments, preferentially a nucleotide probe of the invention.

[0070] Other characteristics and advantages of the invention are illustrated by the example following the description as well as by the figures whose legends are indicated below.

[0071]FIG. 1: “Southern Blot” analysis of the DNA of BCG (IPP). A). The fragment of 259 base pairs comprising a part of the regX3 gene was used as a probe to detect the gene in different restriction fragments in the chromosomal DNA of the BCG (IPP). The restriction enzymes used are indicated at the top of the figure.

[0072] B). Partial restriction map of the locus of the senX3 and regX3 genes of the BCG (IPP). The restriction sites are indicated with respect to the senX3 and regX3 genes, as well as the probe used.

[0073]FIG. 2: Nucleotide sequence of the senX3 and regX3 genes of the BCG (IPP) and derived protein sequences. The arrows indicate palindromic sequences. The putative Shin-Dalgarno sequence is underlined in dots (SD). The predicted transmembrane sequence of SenX3 is doubly underlined. The regions of SenX3 conserved in other sensors are underlined and annotated: H, region, containing modified histidine; N, region rich in asparagine; F, region rich in phenylalanine; G1 and G2, regions rich in glycine. The small vertical arrows indicate the residues which are predicted to be phosphorylated. The annotated sequence PgmY indicates the end of the gene encoding phosphoglycerate mutase.

[0074]FIG. 3: hydrophobicity profile of SenX3. The positive values indicate hydrophobic regions and the negative values indicate hydrophilic regions. The arrows indicate the possible initiation codons and the two horizontal lines indicate predicted transmembrane regions. The numerals at the top of the figure indicate the numbers of amino acids.

[0075]FIG. 4: comparison of the intercistronic region between the BCG (IPP) and M. tuberculosis (IPL). The senX3 and regX3 genes are indicated by the arrows. The nucleotide sequences are indicated as well as the inferred protein sequences.

[0076]FIG. 5: Southern blot analysis of DNA of M. tuberculosis and of BCG. The chromosomal DNA of M. tuberculosis (IPL) (on the right) and of BCG (IPP) (on the left), was digested by PstI, subjected to electrophoresis in agarose and analyzed by hybridization with the senX3-regX3 intergenic probe of M. tuberculosis (IPL).

[0077]FIG. 6: analysis by electrophoresis in agarose (2.5%) of the products obtained by PCR carried out on different mycobacterial strains. Lanes 1 to 3: group 1, respectively M. microti, M. tuberculosis V808, M. tuberculosis V761; lanes 4 and 5, group 2, respectively M. tuberculosis V729, M. bovis 60; lanes 6 to 8, group 3, respectively M. bovis 63, M. bovis 78 and M. bovis AN5; lanes 9 and 10, group 4, respectively M. tuberculosis H37Ra (IPL) and M. tuberculosis H37Rv (IPP); lanes 11 and 12, group 5, respectively M. tuberculosis (IPL), M. bovis 76; lanes 13 and 14, group 6, respectively M. bovis (BCGite 29) and M. bovis BCG (IPP), lane 15, group 7, M. tuberculosis No. 19.

[0078]FIG. 7: Southern blot analysis of the fragments obtained in FIG. 6. The probe used is the senX3-regX3 intercistronic region of M. tuberculosis (IPL).

EXAMPLES Example 1 Genomic Map and Cloning of the senX3-regX3 Genes of M. bovis BCG (IPP), Coding for the Two-Component Mycobacterial System

[0079] Wren et al. (1992) amplified a fragment of 259 base pairs from the gene of M. tuberculosis (IPL) which they called regX3.

[0080] The authors of the present invention amplified the corresponding sequence from M. bovis BCG (vaccine strain 1173P2, obtained at the WHO collection of Stockholm, Sweden) using the following synthetic oligonucleotides: 5′-CGAGGAGTCCCTCGCCGATCCGC-3′ and 5′-AGCGCCCCAGCTCCAGCCGACC-3′. The amplification was carried out by using these primers and a Deep Vent polymerase (New England Biolabs). The amplification product of 259 base pairs resulting was then purified by electrophoresis on agarose gel and then sub-cloned in the Smal site of the pBluescript KS+ vector (Stratagene) according to standard protocols (Sambrook et al., 1989). This insert was then removed from the recombinant plasmid by digestion with BamHI and EcoRI and then labelled with dCTP [α-³²P] by “random priming” (random labelling) using the “random priming” kit marketed by Boehringer according to the conditions recommended by the manufacturer. The genomic DNA of M. bovis BCG (IPP), cultured in Sauton medium (Sauton, 1912) at 37° C. in flasks for stationary culture of tissues, was extracted as described previously (Kremer et al., 1995a) and digested by different restriction enzymes. This DNA was then subjected to electrophoresis on agarose gel and to Southern Blot analysis according to the standard methods (Sambrook et al., 1989). The testing of the blot with the [³²P]-labelled DNA showed that digestion by KpnI with either BamHI, EcoRI or PstI produced pairs of hybridizing bands, with one band labelled more intensely than the other in each of the pairs (FIG. Ia). These bands were attributed to the 5′ and 3′ sequences respectively, of the unique asymmetric KpnI site of the probe. This allowed the BamHI, EcoRI and PstI 5′ and 3′ sites of the KpnI site to be localized (FIG. Ib). The digestions with only PstI gave a single hybridization band. As could be predicted, the digestion with KpnI gave two bands and the digestion with BamHI and EcoRI led to the obtainment of a single band of approximately 3.5 kb.

[0081] The genomic DNA of the BCG was digested with BamHI and EcoRI, and DNA fragments of 3 to 4 kb were isolated after electrophoresis on agarose gel. These fragments were inserted into pBluescript SK (Stratagene) restricted by BamHI and EcoRI. 900 recombinant clones were screened with the [³²P]-labelled probe by the said standard technique of “colony blot hybridization” (Sambrook et al., 1989). Three of them were positive and turn out to contain the same BamHI/EcoRI DNA insert of 3.2 kb by restriction analysis. A hybridizing clone was isolated for the latter studies as was named pRegX3Bcl (I-1765).

Example 2 Sequence of the Genes of M. bovis BCG (IPP) Coding for the Two-Component System

[0082] The SalI fragments of 1.0, 1.5 and 0.7 kb were isolated from the 3.2 kb insert of pRegX3Bcl (I-1765), and sub-cloned in pBluescrip SK−. The sequence of the 3.2 kb insert, as well as that of the sub-clones SalI was determined on the two strands by the method of “primer walking”. The nucleotide sequence of 3208 bp of M. bovis BCG (IPP) (FIG. 2) showed the frequency of C+G characteristic of the mycobacteria (66.6%). Two principal open reading frames (OFT) designated by regX3 an senX3 were identified (FIG. Ib and 2). The regX3 gene starts with an ATG triplet at the 1679 position and ends with a TAG triplet at the 2360 position. It contains the sequence of the ³²P-labelled probe and codes for a protein whose inferred sequence of amino acids is 227 residues for a calculated molecular mass of 24,881 Da. The limit upstream of the senX3 ORF is not certain because five in-phase potential initiation codons (ATG or GTG) were found in a short distance (from 296 to 446) (FIGS. 2 and 3). However, only the GTG at position 296 is preceded with 9 nucleotides upstream, by sequence homologous to the Shine-Dalgarno sequence of Escherichia coli (4 of the 6 residues are identical to AGGAGG). Thus, this GTG codon could be the limit in 5′ of the senX3 ORF which ends with the TGA strop codon at the position 1526. This ORF is thus presumed to code for a protein of 44,769 Da composed of 410 amino acid residues. The senX3 gene is preceded at 273 nucleotides upstream by the 3′ part of an ORF encoding a homologue of the phosphoglycerate mutases. A sequence capable of forming a hairpin structure is localized between the positions 178 and 194. This sequence could function as a terminatory transcription sequence of the ORF upstream.

[0083] A search on the MycDB database showed that the senX3 and regX3 genes are also present in M. leprae. The products which they encode are respectively similar to 82.7% and 94.9% to the proteins senX3 and RegX3 of M. bovis (IPP). The senX3 ORF of M. leprae turns out to be initiated by a GTG codon and is preceded by an ORF coding for a homologue of phosphoglycerate mutase, similar to that which was found in M. bovis BCG (IPP).

Example 3 Cloning and Sequencing of the senX3 -regX3 Genes of M. tuberculosis (IPL)

[0084] The senX3 and regX3 genes of M. tuberculosis (IPL) were cloned by PCT from chromosomal DNA of M. tuberculosis 22962067, a clinical isolate from the collection of mycobacteria of the Institut Pasteur de Lille. The DNA of M. tuberculosis (IPL) was extracted by the method described above for the extraction of chromosomal DNA of the BCG (Kremer et al., 1995a). The fragment of 2.2 kb containing the senX3 and regX3 genes of M. tuberculosis (IPL) was amplified by PCR using the following primers, homologous to the adjacent sequences of the senX3 and regX3 genes of M. bovis BCG (IPP): 5′-TGGCGTAGTGTGTGACTTGTC-3′ and 5′ GACCAGACAGTCGCCAAGGTT-3′. The amplified fragment was cloned in the SmaI site of pBluescript SK− (Version II) to produce a plasmid named pRegX3Mt1 (I-1766). The total fragment of 2.2 kb was then sequenced using the same strategy as that described in Example 2 for the senX3 and regX3 genes of the BCG (IPP). The DNA sequence of the senX3 and regX3 ORF of M. tuberculosis (IPL) as well as the 5′ region upstream of senX3 and the 3′ region downstream of regX3 were identical to those of the BCG (IPP). However, the intercistronic region between senX3 and regX3 demonstrated interesting differences which are studied in the example below.

Example 4 Analysis of the senX3-regX3 Intercistronic Region

[0085] The intercistronic region between senX3 and regX3 contains a perfect duplication of 77 base pairs in tandem in the BCG (IPP) (FIG. 4). Each repeated sequence contains a short ORF which has the capacity to code for a peptide of 25 amino acids. The ATG initiation codon which can be inferred from the first repetition overlaps the TGA stop codon of senX3. This repeated sequence ends with two stop codons in-phase, which overlap the ATG initiation codon inferred from the following repetition. The ORF of the second repetition also ends with a double TGA stop codon, which overlaps the ATG start codon of regX3.

[0086] The intercistronic region of M. tuberculosis (IPL) is longer and contains a third repeated sequence which, however, is not complete. It contains a short in-phase internal deletion of the nucleotides 40 to 66 which are replaced by GAG. The ORF of this third repetition also ends with a double stop codon overlapping the ATG codon of regX3.

[0087] The sequence of the intercistronic region of M. leprae is shorter than that of BCG (IPP). It contains 52 base pairs.

[0088] The existance of these structures in repetition in the senX3-regX3 intercistronic region led the inventors to find out whether they were present in other regions of M. tuberculosis (IPL) or in the chromosome of M. bovis BCG (IPP) by Southern Blot analysis. The senX3-regX3 intercistronic region of M. tuberculosis (IPL) was obtained by enzymatic digestion of pRegX3Mt2 with EcoRI and BamHI. The resulting DNA fragment of 491 base pairs was then digested with BsrI-AluI to produce a DNA fragment of 218 base pairs which corresponds to the senX3-regX3 intergenic fragment of M. tuberculosis (IPL). This DNA fragment was then labelled at random with digoxygenin-dUTP (kit cat. No. 10093657) according to the recommendations of the manufacturer (Boehringer Mannheim). pRegX3Mt2 was produced by a first amplification by PCR of a fragment of chromosomal DNA of 471 base pairs of M. tuberculosis 22962067 (IPL) using the following oligonucleotides: 5′AAACACGTCGCGGCTAATCA 3′ and 5′CCTCAAAGCCCCTCCTTGCGC 3′ and the resulting amplified fragment was then cloned in the SmaI site of pBluescript KS−.

[0089] The chromosomal DNA of M. tuberculosis (IPL) was then totally digested with PstI and subjected to electrophoresis on agarose gel and analysed by Southern Blot according to the customary procedures (Sambrook et al., 1989). The labelled probe was then used for hybridization as indicated below.

[0090] The probe was at first denatured by boiling for 5 minutes, and the membrane was incubated with the probe overnight at 68° C., after a prehybridization for two hours at 68° C. The buffer used for the prehybridization and the hybridization was 5×SSC (Sambrook et al., 1989), 0.1% N-laurylsarcosine (w/v); 0.2% SDS (w/v); 1% blocking reagent (Boehringer Mannheim). The membrane was then washed twice for five minutes in 2×SSC (Sambrook et al., 1989) and 0.1% SDS at ambient temperature and twice for 15 minutes in 0.1×SSC, 0.1% SDS at 68° C. The hybridized probes were then immunologically detected with antidigoxygenin Fab fragments conjugated to alkaline phosphatase and a chemiluminescent CSPD substrate (Boehringer Mannheim).

[0091] As can be seen in FIG. 5, different copies of the senX3-regX3 intercistronic region are present in M. tuberculosis (IPL), as well as in M. bovis BCG (IPP).

Example 5 Amplification by PCR of the senX3-regX3 Intercistronic Region of Different Strains of Mycobacteria

[0092] Given that the intercistronic region separating the senX3 and regX3 genes has variations in length between M. bovis BCG (IPP), M. tuberculosis (IPL) and M. leprae, the authors of the present invention analysed the corresponding region in other strains of M. bovis (including non-BCG strains) and of M. tuberculosis. They likewise analysed other species of mycobacteria: (i) the other members of the M. tuberculosis complex: M. africanum and M. microti and (ii) the mycobacteria other than those of the M. tuberculosis complex.

[0093] The strains analysed are indicated Tables 1 and 2. Their chromosomal DNA was prepared as indicated below. The mycobacteria were recovered by centrifugation (300 revs/min; 30 minutes), starting from 100 ml of culture and incubation for 1 hour at 37° C. in 10 ml of phosphate buffer (0.4 M sucrose, 10 mM EDTA, Tris-HCl (pH 8) 10 mM, 4 mg/ml of lysozyme). The protoplasts obtained were recovered by centrifugation (3000 revs/min; 20 minutes) and then lysed by placing them to incubate for 1 hour at 60° C. in 6 ml of L buffer (10 mM NaCl, SDS 6%, 10 mM Tris-HCl (pH 8), 500 μg/ml of proteinase K). After addition of 1.5 ml of 5 M NaCl, the mixture was centrifuged (14,000 revs/min., 20 minutes). The supernatant was subjected to extraction with phenol-chloroform and the DNA was precipitated with isopropanol. The resuspended pellet was then treated with RNase, extracted with phenol-chloroform and chloroform and then precipitated with ethanol. The pellet was then dried in air and resuspended in 100 μl of TE buffer (10 mM Tris-HCl, 1 mM EDTA, pH 7.6).

[0094] The primers for PCR analysis were chosen from the DNA sequence of the senX3 and regX3 genes of M. bovis BCG (IPP) and M. tuberculosis (IPL). A primer (C5) hybridized to the 3′ end of senX3 gene and had the following 5′-3′ sequence: 5′GCGCGAGAGCCCGAACTGC3′. The other primer (C3) hybridized to the 5′ end of the regX3 gene and had the following 5′-3′ sequence: 5′-GCGCAGCAGAAACGTCAGC-3′. With these two primers, the PCR product comprises 56 additional base pairs at the 5′ end of the intergenic region and 62 additional base pairs at its 3′ end with respect to the intercistronic region. The PCR product has a length of 369 base pairs for M. tuberculosis 22962067 (IPL) and 276 base pairs for M. bovis BCG (IPP).

[0095] The PCR amplifications were carried out in a thermocycler (Perkin Elmer) by incubation of 1 μl of chromosomal DNA with the following reaction mixture: C5 and C3 oligonucleotides (1 μg/1 μl of each), dTNP (2 μl) 25 mM, TMACl (tetramethylammonium chloride) (2 μl) 5 mM, 10×enzymatic buffer (10 μl), Vent DNA polymerase (1 U/0.5 μl), water (82 μl). The amplification was carried out over 30 cycles at 94° C. for one minute, 65° C. for one minute and 72° C. for one minute. 10 μl of PCR product were then subjected to electrophoresis on 2.5% agarose gel and visualized with ethidium bromide. The negative controls containing all the PCR reagents except the DNA matrix were treated in parallel with the samples. The length of the PCR product was estimated by comparison with the DNA scale of 1 kb.

[0096] No PCR product was detected for the 11 strains of non-tuberculosis mycobacteria tested (see Table 2). Three strains of Streptomyces (S. cacaoi, S. R61 and S. R39), as well as E. coli TG1 likewise gave negative results in PCR. On the other hand, PCR products were obtained with all the members of the M. tuberculosis complex. The specificity of the PCR products was confirmed in all the cases by Southern Blot analysis using as probe the senX3-regX3 intergenic region of M. tuberculosis (IPL) labelled with digoxygenin, as described in Example 4.

[0097] The length of the amplified fragments is indicated in Table 3. Eight different groups with different amplicon lengths were obtained. Out of the 35 clinical isolates of M. tuberculosis tested, 34 gave PCR fragments of 329 base pairs which could not be distinguished from that obtained with the reference strain M. tuberculosis 22962067 (IPL) (group 5). One strain of M. tuberculosis (No. 19) gave a PCR product of 254 base pairs (Group 8). M. tuberculosis S200 also belongs to group 5.

[0098] Strains of M. tuberculosis from Vietnam and not containing the IS6110 sequence (V. 808, V. 761 and V. 729) were different from the major M. tuberculosis group (group 5). The length of their PCR product exceeded 500 base pairs (one product of +/−500 base pairs and two of 560). The laboratory strains of M. tuberculosis H37Ra and H37Rv had PCR products which were slightly larger than those obtained in group 5 of the strains of M. tuberculosis and these two strains are classified among group 4. These two strains are odd in as far as the uncertain pathogenicity for H37Rv and the loss of pathogenicity of H37Ra are concerned. In addition, no clinical case of M. tuberculosis was present in groups 4, 6, 8: these three groups are thus specific for the non-virulent strains.

[0099] The BCG strains (vaccine strains and isolates of clinical cases of BCGites) were divided into three groups (No. 4, 6 and 8). The PCR products obtained from these strains had a length of 353 base pairs, 276 and 199 respectively.

[0100] The most important variability at the level of the length of the PCR fragments was encountered for the non-BCG strains of M. bovis. Six strains were tested. Three of them, including the reference strains AN5, gave PCR products of 408 base pairs (group 3). The three other strains were grouped in groups 2, 5 and 7 corresponding to PCR fragments of approximately 500 and of 329 or 254 base pairs respectively. TABLE 1 M. tuberculosis complex Species Strains Sources M. tuberculosis reference (22962067) IPL* H37Ra IPP** H37Rv IPP (Marchal) S200 CHR of Lille*** 35 clinical isolates CHR of Lille (different profiles of IS6110 RFLP) 3 clinical isolates IPP (Marchal) from Vietnam: v808, V761, V729 M. bovis BCG (vaccine strain3): IPP BCG IPP IPL BCG Moreau IPL BCG Japonicu3 IPP BCG Pragues IPP BCG Montreal IPP BCG Russe IPP BCG Glaxo case of BCGites: 4 clinical isolates IPL (28, 29, 30, 31) CHR of Lille non BCG: reference ANS IPP (Marchal) 1 clinical isolates (1) CHR of Lille 2 isolates from goats University of (60, 63) Zaragoza (Martin) 2 isolates from cows University of (76, 78) Zaragoza (Martin) M. africanum clinical isolate M. microti reference ATCC 19422 CHR of Lille IPP (Marchal)

[0101] TABLE 2 Atypical mycobacteria Species and strains Sources M. aurum clinical isolates (IPL) M. avium clinical isolates (IPL) M. chelonae clinical isolates (IPL) M. flavescens clinical isolates (IPL) M. fortuitum clinical isolates (IPL) M. kanasii clinical isolates (CHR of Lille) M. marinum clinical isolates (IPL) M. scrotulaceum clinical isolates (IPL) M. smegmatis IPL M. terae clinical isolates (IPL) M. xenopi clinical isolates (IPL)

[0102] TABLE 3 Amplification by PCR of the senX3-regX3 intergenic region and sequencing Estimated length of the product Composition of obtained the senX3-regX3 Strains of mycobacteria during PCR intergenic region 1 M. microti 560 bp 77 77 77 77 77 53 M. tuberculosis V.808 -->-->-->-->-->-> V.761 2 M. tuberculosis V.729 ±500 bp  M. bovis 60 (goat) 3 M. bovis 63 (goat) 406 bp 77 77 77 53 78 (cow) -->-->-->-> AN5 4 M. tuberculosis H37Rv (IPP) 353 bp 77 77 77 M. tuberculosis H37Ra (IPL) -->-->-> BCG: BCG Japonicus BCGite 28 BCGite 30 5 M. africanum 329 bp 77 77 53 M. bovis 76 (cow) -->-->-> M. tuberculosis: reference IPL S200 34 clinical isolates. 6 BCG: IPP reference 276 bp 77 77 BCG Moreau -->--> BCG Russe BCG Glaxo BCGite 29 BCGite 31 7 M. bovis I 254 bp 77 53 M. tuberculosis No. 19): one of -->-> the 35 clinical isolates of the CHR of Lille tested 8 BCG: BCG Pregues 199 bp 77 BCG Montreal --->

[0103] The amplified DNA fragments comprise senX3-regX3 intercistronic region as well as 56 base pairs upstream of this and 62 base pairs downstream. The arrows 77 and 53 designate the repeated elements found in each group. The strains for which the intercistronic region was sequenced are underlined.

BIBLIOGRAPHY

[0104] Barnes, P. F., A. B Bloch, P. T. Davidson, and D. E. Snider, 1991. Tuberculosis in patients with human immunodeficiency virus infection. N. Engl. J. Med. 324: 1644-1650.

[0105] Brisson-Noel, A., D. Lecossier, X. Nassif, B. Giquel, V. Levy-Frebault, and A. J. Hance. 1989. Rapid diagnosis of tuberculosis by amplification of mycobacteria DNA in clinical samples. Lancet ii: 1069-1072.

[0106] Brisson-Noel, A., C. Aznar, C. Chureau, S. Nguyen, C. Pierre, M. Bartoli, R. Bonete, G. Plaloux, B. Cicquel, and G. Garrigue. 1991. Diagnosis of tuberculosis by DNA amplification in clinical practice evaluation. Lancet 338: 364-366.

[0107] Cave, M. D., K. D. Eisenach, P. F. McDermott, J. I. I. Bates, and J. T. Crawford, 1991. IS6110: conservation of sequence in the Mycobacterium tuberculosis complex and its utilization in DNA fingerprinting. Mol. Cell. Probes 5: 73-80.

[0108] Clarridge, J. E., R. M. Shawar, T. Shinnick, and B. B. Plikaytis; 1993. Large-scale use of polymerase chain reaction for detection of Mycobacteria tuberculosis in a routine mycobacteriology laboratory. J. Clin. Microbiol. 31: 2049-2056.

[0109] Cousins, D. V., S. D. Wilton, B. R. Francis and B. L. Beth. 1992. Use of PCR for rapid diagnosis of tuberculosis. J. Clin. Microbiol. 30: 255-258.

[0110] Eisenbach, K. O., D. Cave, J. I. I. Bates, and J. T. Crawford. 1990. Polymerase chain reaction amplification of a repetitive DNA sequence specific for Mycobacterium tuberculosis. J. Infect. Dis. 161: 977-981. Folgueira, L., R. Delgado, E. Palenquo, and A. R. Noriega. 1993. Detection of Mycobacterium tuberculosis DNA in clinical samples by using a simple lysis method and PCR. J. Clin. Microbiol. 31: 1091-1021.

[0111] Forbes, B. A. and K. Hicks. 1993. Direct detection of Mycobacteria tuberculosis in respiratory specimens in a clinical laboratory by P.C.R. J. Clin. Microbiol. 31: 1688-1694.

[0112] Fries, J. W. U., R. J. Patel, W. F. Piessens, and D. F. Wirth. 1991. Detection of untreated mycobacteria by using PCR and specific DNA probes. J. Clin. Microbiol. 29: 1744-1747.

[0113] Hermans, P. W. M., A. R. J. Schullema, D. van Soolingen, C. P. H. J. Verstynen, E. M. Bik, J. E. R. Thole, A. H. J. Kolk and J. D. A. van Embden. 1990. Specific detection of Mycobacterium tuberculosis complex strains by PCR, J. Clin. Microbiol. 28: 1204-1213.

[0114] Imaeda, T. 1985. Deoxyribonucleic acid relatedness among selected strains of Mycobacterium tuberculosis, M. bovis, M. bovis BCG, M. microti, and M. africanum. Int. J. Syst. Bacteriol. 35: 147-150.

[0115] Jonas, J. Clin. Microbiol., M. J. Alden, and J. I. Curry. 1993. Detection and identification of Mycobacteria tuberculosis directly from sputum sediments by amplification of rRNA. J. Clin. Microbiol. 31: 2410 2416.

[0116] Kolk, A. H. I., A. R. J. Schuitema, S. Kuiper, V. van Leeuwen, P. W. M. Hermans, J. D. A. van Embden, and R. A. Hartskeerl. 1992. Detection of Mycobacterium tuberculosis in clinical samples by using PCR and a nonradioactive detection system. J. Clin. Microbiol. 30: 2567-2575.

[0117] Kox, L. F. F., J. van Leeuwen, S. Knijper, H. M. Jansen, and A. H. J. Kolk. 1995. PCR assay based on DNA coding for 16S rRNA for detection and identification of mycobacteria in clinical samples. J. Clin. Microbiol. 33: 3225-3233.

[0118] Kremer. L., A. Baulard J. Estaquier, O. Poulain-Godefroy, and C. Locht. 1995a. Green fluorescent protein as a new expression marker in mycobacteria. Mol. Microbiol. 17: 913-922.

[0119] Krerner. L., A. Baulard, J. Estaquier, J. Content, A. Capron, and C. Locht. 1995b. Analysis of the Mycobacterium tuberculosis 85A antigen promoter region. J. Bacteriol. 177: 642-653.

[0120] Noordhoek, G. O., A. H. J. Kolk, G. Bjune, D. Catty, J. W. Dalc, P. E. Fine, P. Godfrey-Faussett, S-N. Cho, T. Shinnick, S. B. Svenson, S. Wilson, and J. D. A. van Embden. 1994. Sensitivity and specificity of PCR for detection of Mycobacterium tuberculosis: a blind comparison study among seven laboratories. J. Clin. Microbiol. 32: 277-204.

[0121] Parkinson, J. S. and Kofoid, E. C. 1992. Communication modules in bacterial signaling proteins. Annu. Rev. Genet. 26, 71-112.

[0122] Pierre. C., D. Lecossier, Y. Boussougnant, D. Bocart, V. Joly, P. Veni, and A. Hance. 1991. Use of a reamplification protocol improves sensitivity of detection of Mycobacterium tuberculosis in clinical samples by amplification of DNA. J. Clin. Microbiol. 29: 712-717.

[0123] Saboor, S. A., N. Y. Johnson, and J. McFadden. 1992. Detection of mycobacteria DNA in sarcoidio and tuberculosis with PCR. Lancet 339; 1012-1015.

[0124] Saiki, R. K., D. H. Gelfand, S. Stoffel, S. J. Scarf, R. Higuchu, G. T. Horn, K. E. Mullis, and H. A. Erlich. 1988. Primer-directed enzymatic amplification of DNA with a thermostable DNA polymerase. Mol. Cell. Probes 5: 515-219.

[0125] Sambrook, J., E. F. Fritsch, and T. Maniatis, 1989. Molecular cloning: a laboratory manual, Cold Spring Harbor, N.Y.

[0126] Sauton, S. 1912. Sur la nutrition minérale du bacille tuberculeux (On the mineral nutrition of the tuberculosis bacillum). C. R. Acad. Sci Paris. 155:860.

[0127] Shankhar, S., N. Manjunath, K. K. Mohan, K. Praasad, M. Behari, G. K. Shriniwas, and K. Ahuja. 1991. Rapid diagnosis of tuberculosis meningitis by PCR. Lancet: 5-7.

[0128] Sjöbring, U., M. Mecklenburg, A. B. Andersen, and H. Mijörner. 1990. PCR for detection of Mycobacterium tuberculosis. J. Bacteriol. 169: 1080-1088.

[0129] Sudre, P., G. en Dam, and A. Kochi. 1992. Tuberculosis: a global overview of the situation today. Bull. W.H.O. 70: 149-159.

[0130] Telenti, A., F. Marchesi, M. Balz, F. Bally, E. C. Bottger, and T. Bodmer. 1993. Rapid identification of mycobacteria to the species level by polymerase chain reaction and restriction enzyme analysis. J. Clin. Microbiol. 31: 175-178.

[0131] van Soolingen, D., P. E. W. de Haas, P. W. M. Hermans, P. M. Groenen, and J. D. A. van Embden. 1993. Comparison of various repetitive DNA elements as genetic markers for strain differentiation and epidemiology of Mycobacgerium tuberculosis. J. Clin. Microbiol. 31: 1987-1995.

[0132] van Soolingen, D., P. E. W. de Haas, J. Haagsma, T. Eger, P. W. H. Hermans, V. Ritacco, A. Alito, and J. D. A. van Embden. 1994. Use of various genetic markers in differentiation of Mycobacterium bovis from animals and humans and for studying epidemiology of bovine tuberculosis. J. Clin. Microbiol. 32: 2425-2433.

[0133] Wren, B. W., Colby, S. M., Cubberley, R. R. and Palten, M. J. 1992. Degenerate PCR primers for the amplification of fragments from genes encoding response regulators from a range of pathogenic bacteria FEMS Lett. 99, 287-292.

1 21 1 77 DNA Mycobacterium tuberculosis 1 atgacctgcg ccgacgacga tgcagagcgt agcgatgagg tgggggcacc acccgcttgc 60 gggggagagt ggcgctg 77 2 53 DNA Mycobacterium tuberculosis 2 atgacctgcg ccgacgacga tgcagagcgt agcgatgagg aggagtggcg ctg 53 3 52 DNA Mycobacterium leprae 3 atgacacccg cgcaggcgat gatgcagagc gaagtgacga gagggaatgt ga 52 4 19 DNA Mycobacterium tuberculosis 4 gcgcgagagc ccgaactgc 19 5 19 DNA Mycobacterium tuberculosis 5 gcgcagcaga aacgtcagc 19 6 23 DNA Mycobacterium bovis 6 cgaggagtcg ctggccgatc cgc 23 7 22 DNA Mycobacterium bovis 7 agcgccccag ctccagccga cc 22 8 21 DNA Mycobacterium bovis 8 tggcgtagtg tgtgacttgt c 21 9 21 DNA Mycobacterium bovis 9 gaccagacag tcgccaaggt t 21 10 20 DNA Mycobacterium tuberculosis 10 aaacacgtcg cggctaatca 20 11 21 DNA Mycobacterium tuberculosis 11 cctcaaagcc cctccttgcg c 21 12 3208 DNA Mycobacterium bovis 12 gaattccgct gcgctacgac ctggattccg cgatgaggcc gctggtgcgc ggtggtacgt 60 atctggaccc ggaggcggca gccgccggcg ccgccgcggt ggccggccag ggccgcgggt 120 aattgtttga gatcccacct gccggcggtt tcggcggctg atggtgtgct ttggtgcgct 180 gtttgccaaa cagcatgtga acggtaaccg aacagctgtg gcgtagtgtg tgacttgtcc 240 gattttggcc ttgccgcgct agggcgacgt tcaccggatt tgtaggattt tccttgtgac 300 tgtgttctcg gcgctgttgc tggccggggt tttgtccgcg ctggcactgg ccgtcggtgg 360 tgctgttgga atgcggctga cgtcgcgggt cgtcgaacag cgccaacggg tggccacgga 420 gtggtcggga atcacggttt cgcagatgtt gcaatgcatt gtcacgctga tgccgctggg 480 cgccgcggtg gtggacaccc atcgcgacgt tgtctacctc aacgaacggg ccaaagagct 540 aggtctggtg cgcgaccgcc agctcgatga tcaggcctgg cgggccgccc ggcaggcgct 600 gggtggtgaa gacgtcgagt ccgacctgtc gccgcgcaag cggtcggcca cgggtcgatc 660 cgggctatca gtgcatgggc atgcccggtt gctgagcgag gaagaccgcc ggttcgccgt 720 ggtgttcgtg cacgaccagt cggattatgc gcggatggag gcggctaggc gtgacttcgt 780 ggccaacgtc agtcacgagc tcaagacgcc cgtcggtgcc atggctctac tcgccgaggc 840 gctgctggcg tcggccgacg actccgaaac cgttcggcgg ttcgccgaga aggtgctcat 900 tgaggccaac cggctcggtg acatggtcgc cgagttgatc gagctatccc ggctacaggg 960 cgccgagcgg ctacccaata tgaccgacgt cgacgtcgat acgattgtgt cggaagcgat 1020 ttcacgccat aaggtggcgg ccgacaacgc cgacatcgaa gtccgcaccg acgcgcccag 1080 caatctgcgg gtgctgggcg accaaactct gctggttacc gcactggcaa acctggtttc 1140 caatgcgatt gcctattcgc cgcgcgggtc gctggtgtcg atcagccgtc gccgtcgcgg 1200 tgccaacatc gagatcgccg tcaccgaccg gggcatcggc atcgcgccgg aagaccagga 1260 gcgggtcttc gaacggttct tccgggggga caaggcgcgc tcgcgtgcca ccggaggcag 1320 cggactcggg ttggccatcg tcaaacacgt cgcggctaat cacgacggca ccatccgcgt 1380 gtggagcaaa ccgggaaccg ggtcaacgtt caccttggct cttccggcgt tgatcgaggc 1440 ctatcacgac gacgagcgac ccgagcaggc gcgagagccc gaactgcggt caaacaggtc 1500 acaacgagag gaagagctga gccgatgacc tgcgccgacg acgatgcaga gcgtagcgat 1560 gaggtggggg caccacccgc ttgcggggga gagtggcgct gatgacctgc gccgacgacg 1620 atgcagagcg tagcgatgag gtgggggcac cacccgcttg cgggggagag tggcgctgat 1680 gaccagtgtg ttgattgtgg aggacgagga gtcgctggcc gatccgctga cgtttctgct 1740 gcgcaaggag ggctttgagg ccacggtggt gaccgatggt ccggcagctc tcgccgagtt 1800 cgaccgggcc ggcgccgaca tcgtcctgct cgatctgatg ctgcctggga tgtcgggtac 1860 cgatgtatgc aagcagttgc gcgctcggtc cagcgttccg gtgatcatgg tgaccgcccg 1920 ggatagcgag atcgacaagg tggtcggcct ggagctgggc gctgacgact acgtgaccaa 1980 gccctattcg gcacgcgagt tgatcgcacg catccgcgcg gtgctgcgcc gtggcggcga 2040 cgacgactcg gagatgagcg atggcgtgct ggagtccggg ccggttcgca tggatgtgga 2100 gcgccatgtc gtctcggtga acggtgacac catcacgctg ccgctcaagg agttcgacct 2160 gctggaatac ctgatgcgca acagcgggcg ggtgttgact cgcggacaac tgatcgaccg 2220 ggtctggggt gcggactacg tgggcgacac caagacgctc gacgtccatg tcaagcggct 2280 gcgctccaag atcgaagccg acccggctaa cccggttcac ttggtgacgg tgcgcgggct 2340 gggctacaaa ctcgagggct agcggacgcc gacaaccttg gcgactgtct ggtcggctac 2400 ggccagtgcc atcgccatga tggacagctg cgggttcact tccgggcagc tgggcaggat 2460 cgaggcgtcg gcaacccaca cgccctcgac gccgcgcagc cggcccgtcg cgtcgaccgg 2520 acaaagctgc tcgtcggcgc cggcggccgc ggtgcccgtc ggatggaagg cggccaggtg 2580 caggcttctg gggttggctc ggcgcagcac atcctgcagc tcgggcaggg accgcatcgg 2640 tggggcgccg gggataccgg tcagcacctc caccgcgccg gcggcaaaga acagccggcc 2700 aatggcctgc agcgcgaccc gtagcttggc gatctcacct ggagctatgt catagcgcac 2760 caccgtctcg ccgcgcaccg accgcaccgt gccgacgccc cgatcggcca ccatcgcccc 2820 gaatgttgcg atctgcggcg cccggtcgag ccagcggagc agctcggccc cgtagccggg 2880 gaagaccatc gaccccatgc ccggcggtgt ggaggtggcc tcgatcagca cgccgtcgga 2940 ttcgtgaaac tcgtgaaccg ccgcgctctg cagcaccccg cgccacgcga agacgtcgtc 3000 gtcgaagagc ccggccagca tagttgccgg gtgcagcgca aggttgtggc ccagtcgcgg 3060 tgcccaccaa gaccgctgcg ccgcaacagc cctggcgtct ccgtcgcacc ggcggcgacg 3120 acgaccgcgt cggccagcac gtcgagtgtg gtgccgtcgg gccggcgggc tcgcacgcca 3180 taggcccgcc cggcgcggtg caggatcc 3208 13 225 DNA Mycobacterium tuberculosis 13 gctgagccga tgacctgcgc cgacgacgat gcagagcgta gcgatgaggt gggggcacca 60 cccgcttgcg ggggagagtg gcgctgatga cctgcgccga cgacgatgca gagcgtagcg 120 atgaggtggg ggcaccaccc gcttgcgggg gagagtggcg ctgatgacct gcgccgacga 180 cgatgcagag cgtagcgatg aggaggagtg gcgctgatga ccagt 225 14 25 PRT Mycobacterium bovis BCG Translation of ORF nucleotides 1525-1602 of SEQ ID NO12. 14 Met Thr Cys Ala Asp Asp Asp Ala Glu Arg Ser Asp Glu Val Gly Ala 1 5 10 15 Pro Pro Ala Cys Gly Gly Glu Trp Arg 20 25 15 25 PRT Mycobacterium bovis BCG Translation of ORF nucleotides 1602-1679 of SEQ ID NO12. 15 Met Thr Cys Ala Asp Asp Asp Ala Glu Arg Ser Asp Glu Val Gly Ala 1 5 10 15 Pro Pro Ala Cys Gly Gly Glu Trp Arg 20 25 16 25 PRT Mycobacterium tuberculosis Translation of ORF nucleotides 10-87 of SEQ ID NO13. 16 Met Thr Cys Ala Asp Asp Asp Ala Glu Arg Ser Asp Glu Val Gly Ala 1 5 10 15 Pro Pro Ala Cys Gly Gly Glu Trp Arg 20 25 17 25 PRT Mycobacterium tuberculosis Translation of ORF nucleotides 87-164 of SEQ ID NO13. 17 Met Thr Cys Ala Asp Asp Asp Ala Glu Arg Ser Asp Glu Val Gly Ala 1 5 10 15 Pro Pro Ala Cys Gly Gly Glu Trp Arg 20 25 18 17 PRT Mycobacterium tuberculosis Translation of ORF nucleotides 164-217 of SEQ ID NO13. 18 Met Thr Cys Ala Asp Asp Asp Ala Glu Arg Ser Asp Glu Glu Glu Trp 1 5 10 15 Arg 19 39 PRT Mycobacterium bovis Translation of nucleotides 3-119 of SEQ ID NO12, partial PgmY amino acid sequence. 19 Ile Pro Leu Arg Tyr Asp Leu Asp Ser Ala Met Arg Pro Leu Val Arg 1 5 10 15 Gly Gly Thr Tyr Leu Asp Pro Glu Ala Ala Ala Ala Gly Ala Ala Ala 20 25 30 Val Ala Gly Gln Gly Arg Gly 35 20 410 PRT Mycobacterium bovis Translation of nucleotides 296-1528 of SEQ ID NO12, SenX3 amino acid sequence. 20 Met Thr Val Phe Ser Ala Leu Leu Leu Ala Gly Val Leu Ser Ala Leu 1 5 10 15 Ala Leu Ala Val Gly Gly Ala Val Gly Met Arg Leu Thr Ser Arg Val 20 25 30 Val Glu Gln Arg Gln Arg Val Ala Thr Glu Trp Ser Gly Ile Thr Val 35 40 45 Ser Gln Met Leu Gln Cys Ile Val Thr Leu Met Pro Leu Gly Ala Ala 50 55 60 Val Val Asp Thr His Arg Asp Val Val Tyr Leu Asn Glu Arg Ala Lys 65 70 75 80 Glu Leu Gly Leu Val Arg Asp Arg Gln Leu Asp Asp Gln Ala Trp Arg 85 90 95 Ala Ala Arg Gln Ala Leu Gly Gly Glu Asp Val Glu Ser Asp Leu Ser 100 105 110 Pro Arg Lys Arg Ser Ala Thr Gly Arg Ser Gly Leu Ser Val His Gly 115 120 125 His Ala Arg Leu Leu Ser Glu Glu Asp Arg Arg Phe Ala Val Val Phe 130 135 140 Val His Asp Gln Ser Asp Tyr Ala Arg Met Glu Ala Ala Arg Arg Asp 145 150 155 160 Phe Val Ala Asn Val Ser His Glu Leu Lys Thr Pro Val Gly Ala Met 165 170 175 Ala Leu Leu Ala Glu Ala Leu Leu Ala Ser Ala Asp Asp Ser Glu Thr 180 185 190 Val Arg Arg Phe Ala Glu Lys Val Leu Ile Glu Ala Asn Arg Leu Gly 195 200 205 Asp Met Val Ala Glu Leu Ile Glu Leu Ser Arg Leu Gln Gly Ala Glu 210 215 220 Arg Leu Pro Asn Met Thr Asp Val Asp Val Asp Thr Ile Val Ser Glu 225 230 235 240 Ala Ile Ser Arg His Lys Val Ala Ala Asp Asn Ala Asp Ile Glu Val 245 250 255 Arg Thr Asp Ala Pro Ser Asn Leu Arg Val Leu Gly Asp Gln Thr Leu 260 265 270 Leu Val Thr Ala Leu Ala Asn Leu Val Ser Asn Ala Ile Ala Tyr Ser 275 280 285 Pro Arg Gly Ser Leu Val Ser Ile Ser Arg Arg Arg Arg Gly Ala Asn 290 295 300 Ile Glu Ile Ala Val Thr Asp Arg Gly Ile Gly Ile Ala Pro Glu Asp 305 310 315 320 Gln Glu Arg Val Phe Glu Arg Phe Phe Arg Gly Asp Lys Ala Arg Ser 325 330 335 Arg Ala Thr Gly Gly Ser Gly Leu Gly Leu Ala Ile Val Lys His Val 340 345 350 Ala Ala Asn His Asp Gly Thr Ile Arg Val Trp Ser Lys Pro Gly Thr 355 360 365 Gly Ser Thr Phe Thr Leu Ala Leu Pro Ala Leu Ile Glu Ala Tyr His 370 375 380 Asp Asp Glu Arg Pro Glu Gln Ala Arg Glu Pro Glu Leu Arg Ser Asn 385 390 395 400 Arg Ser Gln Arg Glu Glu Glu Leu Ser Arg 405 410 21 227 PRT Mycobacterium bovis Translation of nucleotides 1679-2362 of SEQ ID NO12, RegX3 amino acid sequence. 21 Met Thr Ser Val Leu Ile Val Glu Asp Glu Glu Ser Leu Ala Asp Pro 1 5 10 15 Leu Thr Phe Leu Leu Arg Lys Glu Gly Phe Glu Ala Thr Val Val Thr 20 25 30 Asp Gly Pro Ala Ala Leu Ala Glu Phe Asp Arg Ala Gly Ala Asp Ile 35 40 45 Val Leu Leu Asp Leu Met Leu Pro Gly Met Ser Gly Thr Asp Val Cys 50 55 60 Lys Gln Leu Arg Ala Arg Ser Ser Val Pro Val Ile Met Val Thr Ala 65 70 75 80 Arg Asp Ser Glu Ile Asp Lys Val Val Gly Leu Glu Leu Gly Ala Asp 85 90 95 Asp Tyr Val Thr Lys Pro Tyr Ser Ala Arg Glu Leu Ile Ala Arg Ile 100 105 110 Arg Ala Val Leu Arg Arg Gly Gly Asp Asp Asp Ser Glu Met Ser Asp 115 120 125 Gly Val Leu Glu Ser Gly Pro Val Arg Met Asp Val Glu Arg His Val 130 135 140 Val Ser Val Asn Gly Asp Thr Ile Thr Leu Pro Leu Lys Glu Phe Asp 145 150 155 160 Leu Leu Glu Tyr Leu Met Arg Asn Ser Gly Arg Val Leu Thr Arg Gly 165 170 175 Gln Leu Ile Asp Arg Val Trp Gly Ala Asp Tyr Val Gly Asp Thr Lys 180 185 190 Thr Leu Asp Val His Val Lys Arg Leu Arg Ser Lys Ile Glu Ala Asp 195 200 205 Pro Ala Asn Pro Val His Leu Val Thr Val Arg Gly Leu Gly Tyr Lys 210 215 220 Leu Glu Gly 225 

1. Fragment of nucleic acids specific mycobacteria belonging to the M. tuberculosis complex, comprising a sequence of nucleotide selected from the sequence SEQ ID No. 1, the sequence SEQ ID No. 2, their complementary sequences or the sequences of nucleic acids capable of hybridizing with one of the preceding sequences under conditions of high stringency.
 2. Fragment of nucleic acids specific to the M. tuberculosis complex, comprising a sequence of nucleotides selected from the sequence SEQ ID No. 1, its complementary sequence or the sequences of nucleic acids capable of hybridizing with one of the preceding sequences under conditions of high stringency.
 3. Fragment of nucleic acids specific to members of the M. tuberculosis complex which are different from BCG, comprising a sequence of nucleotides selected from the sequence SEQ ID No. 2, its complementary sequence or the sequences of nucleic acids capable of hybridizing with one of the preceding sequences under conditions of high stringency.
 4. Cloning and/or expression vector containing a sequence of nucleic acids according to claim
 1. 5. Vector according to claim 4, characterized in that it is the plasmid pRegX3Bcl or PRegX3Mt1 respectively deposited at the CNCM under the numbers I-1765 and I
 1766. 6. Nucleotide probe or nucleotide primer characterized in that it hybridizes specifically with any one of the sequences according to claim 1, the corresponding RNA sequences or the corresponding genes.
 7. Nucleotide probe according to claim 6, comprising 24 consecutive nucleotides selected from the sequences of nucleic acids according to claim
 1. 8. Nucleotide probe according to claim 6, characterized in that is comprises the sequence SEQ ID No. 1 or its complementary strand.
 9. Nucleotide probe according to claim 6, characterized in that it comprises two successive sequences SEQ ID No. 1, followed by a sequence SEQ ID No.
 2. 10. Nucleotide probe for the detection of specific sequences of nucleic acids of members of the M. tuberculosis complex which are different from BCG, characterized in that it is a sequence corresponding to the region of the sequence SEQ ID No. 2 surrounding the GAG codon in the positions 40 to 42 or of its complementary strand.
 11. Nucleotide probe according to claim 10, characterized in that it is a sequence composed of 9 base pairs upstream and 9 base pairs downstream of the GAG codon in the specific positions 40 to 42 of the sequence SEQ ID No.
 2. 12. Nucleotide probe according to claim 10, characterized in that it is the sequence SEQ ID No. 2 or its complementary strand.
 13. Nucleotide probe according to claim 6, characterized in that it is labelled by dioxygenin.
 14. Nucleotide primers for the amplification of a specific nucleotide sequence of mycobacteria belonging to the M. tuberculosis complex, comprising nucleotide sequences corresponding to the sequences adjacent to the senX3-regX3 intergenic region, in the regions 3′ of senX3 and 5′ of regX3.
 15. Primers according to claim 14, characterized in that they comprise 19 nucleotides.
 16. Primers according to claim 14, characterized in that they are the pair of primers 5′GCGCGAGAGCCCGAACTGC3′ and 5′GCGCAGCAGAAACCTCAGC3′.
 17. Use of a sequence according to claim 1, for the production of diagnostic nucleotide probes or of nucleotide primers which can be used in an enzymatic amplification method.
 18. Use of a probe according to any one of claims 6 to 13 as an in vitro tool for detection or for diagnosis of strains of mycobacteria belonging to the M. tubercu losis complex.
 19. Method of detection of strains of mycobacteria belonging to the M. tuberculosis complex in a biological sample, comprising the following steps: (i) contacting the biological sample with a pair of primers according to any one of claims 6, 14 to 16 under conditions allowing hybridization of the said primers to the specific nucleic acids of strains of mycobacteria belonging to the M. tuberculosis complex; (ii) amplification of the said nucleic acids; (iii) contacting a nucleotide probe according to any one of claims 6 to 13 with the said biological sample under conditions allowing the formation of hybridization complexes between the said probe and the amplified sequences of nucleic acids; (iv) detection of the hybridization complexes formed.
 20. Method according to claim 19, characterized in that step (iii) is carried out with a nucleotide probe according to claim
 8. 21. Method of detection of the presence of members of the M. tuberculosis complex other than BCG in a biological sample according to claim 19, characterized in that step (iii) is carried out with a nucleotide probe according to claim
 10. 22. Method of detection and of differential diagnosis of BCG and of other members of the M. tuberculosis complex in a biological sample, characterized in that a detection method according to claim 20 is carried out and in a search is made among the amplified nucleic acids capable of forming hybridization complexes those are found which are likewise capable of forming hybridization complexes with a nucleotide probe according to claim
 10. 23. Method according to claim 21 or claim 22 for differentiating an infection by BCG from an infection by a virulent mycobacterium of the M. tuberculosis complex in an immunodeficient subject.
 24. Method according to claim 23, characterized in that the immunodeficient subject is a subject infected with HIV.
 25. Method for the identification of groups of mycobacteria belonging to the M. tuberculosis complex, characterized in that: the DNA of the said strains previously extracted with a pair of primers according to any one of claims 6, 14 to 16 is contacted under conditions allowing a specific hybridization of the primers with one of the sequences according to claim 1 and the obtainment of amplification products, and the length of the amplification products obtained is measured.
 26. Method according to claim 25, characterized in that the pair of primers according to claim 16 is used.
 27. Kit for the in vitro identification of strains of mycobacteria belonging to the M. tuberculosis complex in a biological sample comprising: a pair of primers according to any one of claims 6, 14 to 16; the reagents necessary to allow the amplification of the sequences of nucleic acids. 